Automated PDF Scraper for Government Public Records

freelancer.com 🟡 2026-04-17

🔹 Automated PDF Scraper for Government Public Records
👤 Client: Skopje, Macedonia Member since 2018-02-08
💰 Price: $28 Average bid
🚩 Problem: Scrape government public records from portals that do not offer bulk-download options.
📦 Existing: Not specified

Specifications:

[Target] Specific government portal(s) with public record PDFs.
[Method] Python script to automate search, pagination, and PDF download.
[UI/UX] Log or CSV report for each file processed (URL, title, status).
[Stack] Python, BeautifulSoup, Selenium, Pandas.
[Security] Handle anti-bot mechanisms like throttling and captchas gracefully. Comply with terms of use.
[Format] Local folder structure: site / year / category; PDFs saved as intact files.

Workflow:

1. Define target government portal(s) and date span for records.
2. Develop Python script using BeautifulSoup for initial scraping, Selenium to handle dynamic content and captchas if necessary.
3. Implement pagination logic to navigate through all pages of results.
4. Extract PDF links from search results and download each file locally in a structured folder.
5. Generate log or CSV report with URL, title, and download status for each processed file.
6. Test script on sample data to ensure compliance and functionality.

⚡ Receive notifications instantly Join our community.

Discord Telegram

Our Social Networks

LinkedIn Twitter Facebook

🕷️️ Job Radar • SCRAPING